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ADAPTIVE SEARCH METHOD IN FEATURE VECTOR SPACE 

This application is a complete application filed under 35 U.S.C. § 1 1 1(a) and claims, 
pursuant to 35 U.S.C. §119 (e)(1), benefit of the filing date of Provisional Application Serial 
No. 60/248,012 filed November 14, 2000 pursuant to 35 U.S.C. §11 1(b). The Provisional 
Application Serial No. 60/248,012 is incorporated herein by reference. Additionally, this 
application claims priority from Korean Application No. 00-79181 filed December 20, 2000, 
which is also incorporated herein by reference. 



The present invention generally relates to a method of searching a feature vector 
space for a feature vector that has similar features to a query vector. More specifically, the 
5 method of the present invention provides a method for efficiently searching a vector space 
indexed based on an approximation for a feature vector having features similar to a query 
vector according to a varying distance measurement. 

2. Description of the Related Art 

In a multimedia database related to a multimedia application, the contents are 
10 typically represented by feature vectors. Similarities among objects are determined by a 
distance measurement defined by feature distances between the query vector and feature 
vectors in a feature vector space. 



BACKGROUND OF THE INVENTION 



Field of the Invention 




To provide further precise retrievals, a distance measurement may be iteratively 
performed using collected information such as user feedback. However, a conventional 
search method does not consider how to iteratively perform a distance measurement 
according to varying factors in a large database. In particular, a conventional indexing 
5 method in a feature vector space has not addressed how to quickly perform a search in an 
environment where a distance measurement is changing, such as on-line retrieval. Thus, 
there still remains a need for accelerating a search in an environment where a distance 
measurement is varying. 



SUMMARY OF THE INVENTION 

10 To solve the above problems, it is an objective of the present invention to provide a 

method for quickly and iteratively searching an approximated feature vector space for a 
feature vector similar to a query vector according to varying measurement conditions. 

Accordingly, to achieve the above objective, the present invention provides a method 
for adaptively searching a feature vector space which includes the steps of (a) performing a 

15 similarity measurement on a given query vector within a feature vector space, and (b) 

applying search conditions limited by the result of the similarity measurement obtained in 
step (a) and performing a changed similarity measurement on the given query vector. 

Preferably, step (b) includes the steps of (b-1) obtaining candidate approximation 
regions by performing approximation level filtering according to a distance measurement 

20 limited by the result of the similar measurement obtained in step (a), and (b-2) performing 
data level filtering on the obtained candidate approximation regions. 

Preferably, step (a) includes the steps of (a-1) obtaining a predetermined number of 
nearest candidate approximation regions by measuring the distances between the query vector 
and approximation regions, and (a-2) obtaining K nearest neighbor feature vectors by 




measuring the distance between each of all feature vectors in the obtained candidate 
approximation regions and the query vector, where K is a positive integer. 

Preferably, step (b-1) includes the steps of (b-l-1) calculating K'-th shortest distance 
for the K nearest neighbor feature vectors obtained according to the previous distance 
5 measurement according to a changed distance measurement, where tC is a positive integer, 
and setting the calculated distance as r u l+1 , and (b-1-2) calculating K'-th smallest lower bound 
limit for the predetermined number of candidate approximation regions based on the previous 
distance measurement according to the changed distance measurement and set as 0 u l+ i- 
Preferably, step (b-1) also includes the following steps of: (b-1 -3a) measuring a 

10 distance Lj(W t+1 ) between the lower bound limit of an approximation region and a query 

vector for a new distance measurement, wherein N is a positive integer denoting the number 
of objects in the feature vector space and i is a variable ranging from 1 to N; (b-1 -4) 
comparing the distance Lj(W t+1 ) obtained in the step (b-i-3a) with a minimum value min ((}>, 
r u t+ i, of K-th smallest upper bound limit <j), r L1 t+1 , and 0 u l+i ; and (b-1 -5) if the distance 

15 Li(W t+! ) is less than or equal to the minimum value min (<j), r u l+) , <2>Vi), setting the 

corresponding approximation region as a candidate approximation region; and (b-l-6) if the 
distance Li(W t+l ) is greater than the minimum value min ((J), r ll t+! , 0 u t+ iX excluding the 
corresponding approximation region. 

Additionally, step (b-1) further includes (b-1 -3b) measuring a distance Uj(W l+l ) 

20 between the upper bound limit of an approximation region and the query vector for the new 
distance measurement, assuming that N is a positive integer denoting the number of objects 
in the feature vector space and i is a variable ranging from 1 to N. and (b-1 -7) updating the K- 
th smallest upper bound limit 0 based on the distance Ui(W^,). 
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Furthermore, steps (b-1-1) - (b-1-6) are repeated until the approximation level 
filtering is performed on N approximation regions where N is a positive integer denoting the 
number of objects in a database. 

Preferably, step (b-2) further includes the steps of (b-2-1) performing a distance 
5 measurement between each of all feature vectors in the candidate approximation regions and 
the query vector, and (b-2-2) determining nearest neighbor feature vectors as retrieved 
vectors depending on the result of the distance measurements performed in the step (b-2-1). 

BRIEF DESCRIPTION OF THE DRAWINGS 

The above objective and advantages of the present invention will become more 
10 apparent by describing in detail preferred embodiments thereof with reference to the attached 
drawings in which: 

FIGS. 1 A and IB are flowcharts showing main steps of a method for adaptively 
searching a feature vector space according to an embodiment of the present invention; and 
FIG. 2 is a pseudo code list for explaining approximation level filtering. 

15 DETAILED DESCRIPTION OF THE INVENTION 

The mam steps of an adaptive search method according to an embodiment of the 
present invention will now be described with reference to FIGS. 1A and IB. A database in 
which multimedia contents are stored is represented as a feature vector space. In this 
embodiment, the feature vector space is approximated with a plurality of hypercubes. 
20 Furthermore, assuming that M is a positive integer denoting the dimensionality of feature 
vectors used to describe an image/video object, and N is a positive integer denoting the 
number of objects in the database, feature vector F and feature vector Q of a query object Q 
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are defined as F = [Fn, F i2 ,. . ., F im ] and Q = [qu, qi:...., q\ m * respectively. Here, the database 
is represented as a feature vector space and the feature vector Q of a query object Q is 
hereinafter called a query vector. 

First, a predetermined number of nearest candidate hypercubes are obtained by 
measuring the distance between a query vector and each of hypercubes (step 102). Then, K 
of nearest neighbor feature vectors are obtained by measuring the distance between the query 
vector and each feature vector in the predetermined number of candidate hypercubes obtained 
in the step 102, where K is a positive integer (step 104). The distance between the query 
vector and each of the feature vectors is measured by calculating weighted Euclidean 
distance. The weighted Euclidean distance is calculated by Equation (1): 

d(Wt,Ft,Q) = (Q - F) T W.(Q - F) ■■■(!) 

where W, is a full symmetric function matrix at iteration t and updated at every iteration. 

Then, for example, the user selects a plurality of multimedia contents similar to those 
that he or she desires to find among calculated multimedia contents and attempts a search 
again. Thus, feedback for changed search conditions can be provided from the user, which is 
called relevance feedback. According to the present invention, features for which feedback is 
provided from the user are reflected in a distance measurement for the next search, thereby 
changing distance measurement conditions. 

According to the present invention, approximation level filtering is performed using 
information from previous iteration t. W t , C|(W t ). and R ( denote a distance measurement 
function used in the previous iteration t, approximation regions that passed the previous 
iteration t or hypercubes in this embodiment, and vectors retrieved using W t , respectively. 




FIG. 2 shows a pseudo code list for explaining the step of approximation level 
filtering. The approximation level filtering is performed using the information from the 
previous iteration t. Referring to FIG. 2, according to the pseudo codes, during the 
approximation level filtering, the K'-th shortest distance is calculated for the K nearest 
5 neighbor feature vectors based on the previous distance measurement according to the 

changed distance measurement where K' is a positive integer, and the calculated distance is 
set as r u t+ i (step 106). Furthermore, K'-th smallest lower bound limit is calculated for the 
predetermined number of candidate hypercubes obtained according to the previous distance 
measurement according to the changed distance measurement and set as 0 u t +i (step 108). 

10 Then, the distance Li(W l+1 ) between each of the lower bound limits of hypercubes in 

the feature vector space and a query vector are measured according to the changed new 
distance measurement. Additionally, the distance Uj(W t+ i) between each of the upper bound 
limits of the hypercubes in the feature vector space and the query vector are measured 
according to the changed new distance measurement as well (step 1 10). The measurements 

15 are done assuming that N is the number of objects or approximation regions in the 

approximated feature vector space or a positive integer denoting the number of hypercubes. 
Additionally, i is assumed to be a variable ranging from 1 to N. Furthermore, the K'-th 
smallest upper bound limit <}> is calculated (step 1 12). 

Next, the distance Lj(W t+1 ) between the lower bound limit of i-th hypercube in the 

20 corresponding vector space and the query vector is compared with a minimum value min ((J), 
r u l+N 0 u l+I ) of the K'-th smallest upper bound limit (]) calculated in the step 1 12, rVi* and 0 u t+ , 
(step 1 14). If the distance Lj(W t+ i) is less than or equal to the minimum value min ((t>. r u t+ i, 
0 u t+ i), a relevant hypercube is set as a candidate hypercube (step 1 16) and if not, the relevant 
hypercube is excluded (step I 18). 



Referring to pseudo code 202 in FIG. 2, it is determined whether or not the distance 
Lj(W l+ i) between the lower bound limit of i-th hypercube in the corresponding vector space 
and the query vector is smaller than all of the K'-th smallest upper bound limits <j), r u t+ i, and 
0 u t+1 . If so, the relevant hypercube Pi is selected as a candidate hypercube as shown pseudo 
5 code 204. Referring to pseudo code 206, if requirements shown in the pseudo code 202 are 
satisfied, the relevant hypercube Pj is selected as a candidate hypercube, and the upper bound 
limit 0 is updated referring to the distance Ui(W l+! ) (step 120). 

Next, assuming that N is a positive integer denoting the number of objects in the 
database or hypercubes, it is determined whether i reaches N (step 124). If i does not reach 
10 N, the steps 1 14 - 124 are repeated until the approximation level filtering is performed on N 
hypercubes. 

According to the method described above, for a hypercube to be set as a candidate 
hypercube, the hypercube must meet new requirements determined from the previous 
distance measurement information such as the pseudo code 202. Thus, requirements for 

15 selecting candidate hypercubes are further limited, thereby reducing the number of selected 
candidate hypercubes. 

Data level filtering is then performed. During the filtering, a distance measurement 
between each of all feature vectors in the candidate hypercubes and the query vector is 
performed (step 126) to determine K ; nearest neighbor vectors as found feature vectors 

20 depending on the result of the distance measurements performed in the step 126. thereby 

completing a search (step 128). In this case, the number of candidate hypercubes is reduced, 
which reduces the computational complexity in measuring the distance between each feature 
vectors in the candidate cubes and the query vector. Thus, the search speed can be improved 
when searching for a feature vector having features similar to a query vector. 

25 Additionally, if new approximation regions are included, the database can be updated faster. 




Although the preferred embodiments of this invention has been described with 
reference to the example that the feature vector space is partitioned into hypercubes and 
approximated, the invention is also applicable to feature vector spaces indexed by other 
known index structures such as R-tree, R* tree, SR-tree and X-tree. It will be understood by 
5 those skilled in the art that various changes in form and details may be made therein without 
departing from the spirit and scope of the invention as defined by the appended claims. 

The search method according to the present invention can be written as a program 
executed on a personal or server computer. The program codes and code segments 
constructing the program can be easily inferred by computer programmers in the industry. 
10 Furthermore, the program can be stored in a computer-readable recording medium. The 

recording medium includes a magnetic recording medium, an optical recording medium, and 
a radio medium. 

According to the present invention, the number of approximation regions is reduced 
during a varying distance measurement, which improves a search speed. 
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